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WEB PORTHOLES: USING WEB PROXIES TO CAPTURE AND 
ENHANCE DISPLAY REAL ESTATE 

FIELD OF THE INVENTION 

The present invention relates to the Internet and, more specifically, to providing 
5 unrequested information to users using the Internet. 

BACKGROUND OF THE INVENTION 

Client-server computer systems have been commonly used for various purposes. 
Typically, client computers or processes (or "clients") are separated from the servers and are 
connected to these servers over a network. The clients request information from one of the 
10 servers by providing a network address of the information. The server locates the 

information based on the provided network address and transmits the information over the 
network to the client, completing the transaction. 

THE INTERNET 

The World Wide Web, or the web, is a popular application of the client/server 
15 computing model FIG. 1 is a simplified block diagram of the relationship between elements 
in a web system 100. One or more web clients 10 are connected to a global information 
network 20. Each client 10 is a computer or a software process such as a browser program. 

Network 20, is the global data comprised of interconnected network devices residing 
in private intranets and public internets. For the purposes of this document, public internets 
20 and private intranets are treated as analogous, and the Network 20 is referred to by the 
commonly held name "Intemet. " 
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Portions of the Internet 20 may be owned and operated by different organizations, 
including, for example, private corporations, Internet Service Providers (ISPs), and backbone 
network carriers. These organization cooperate to provide global data connectivity. 

A web server 40 is likewise connected to the Internet 20. A web server 40 has one or 
5 more internet network addresses and textual host names that are associated in an agreed-upon 
format. Web servers 40 contain multimedia information resources, such as documents and 
images, to be provided to clients 10 on request. Web servers 40 may additionally or 
altematively contain software for dynamically generating such resources. 

Clients 10 and servers 40 communicate using one or more agreed-upon protocols that 
10 specify the format of the information that is communicated. A client 10 establishes a logical 
network connection to a particular server 40 using a communication protocol called the 
Hypertext Transfer Protocol (HTTP). A client 10 may use HTTP to communicate directly to 
web servers 40, or through one or more HTTP intermediary devices known as proxy or 
gateway servers. 

1 5 A Uniform Resource Locator (URL) uniquely identifies each information object 

stored on or dynamically generated by servers 40. A URL is a form of network address that 
identifies the location of information stored on a host server in a network. Information 
documents provided by web servers 40 to clients 10 are encoding in a rich and continually 
growing set of multimedia formats. One of the more popular formats of these information 

20 documents is HTML. HTML documents are commonly referred to as web pages. 

PROXY SERVERS 
Web clients 10 communicate with web servers 40 to perform transactions and 
transport documents. While web clients 10 can communicate directly with web servers 40, it 
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is becoming increasingly popular for web clients 10 to communicate with web servers 40 
through one or more intermediary proxy servers. A proxy server performs a transaction on a 
client's behalf 

hi one arrangement, as shown in FIG. 1, a proxy server 30 is logically interposed 
5 between clients 10 and server 40. Proxy server 30 provides a "middleman" gateway service 
between clients 10 and servers 40. Proxy server 30 receives the client's request, and 
forwards the request to the servers 40. Proxy server 30 then receives the corresponding 
response from servers 40 and passes it back to clients 10. 

Some proxy servers 30 may be explicitly visible to clients 1 0 (requiring client . 
1 0 configuration and participation to use the proxy server). Other proxy servers 30 may be 

transparent to the clients, intercepting the traffic as it flows by. In either case, a proxy server 
30 provides the same middleman function, regardless of client awareness of the proxy server 
30. 

Some proxy servers 30 forward requests and responses untouched. Other proxy 
1 5 servers 30 may subtly or heavily transform the request and response content as part of its 

middleman function. As an example, proxy servers 30 may be used to filter offensive content 
before being shown to an underage audience. As another example, proxy servers 30 may 
transform web pages between natural languages, to provide a portable multi-lingual browsing 
experience. 

20 PROXY SERVERS AND INTERNET SERVICE PROVIDERS 

In one embodiment, proxy servers 30 are used by the Internet Service Provider (ISP), 
such as America On Line, Prodigy, MSN, Excite@Home, AT&T Worldnet, UUNET, etc. 
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These ISPs provide services for users who use clients 10 to get access to servers 40. In this 
context, a server 40 may be referred to as an origin server 40. 

Typically, a user, connecting to the Internet, uses client 10 and dials in to an ISP (or 
has a permanent connection to the ISP). The ISP, as a middle man, typically does not have 
the information the user wants. This information is stored in any of the origin servers 40. 
When the user makes a request for the information, the request goes to the ISP, which 
forwards the request to one of the origin servers 40. Origin servers 40 then provide the 
requested information in the form of web pages to the ISP, and the ISP forwards the web 
pages to the user. 

In brief, everything the user requests or receives goes through the ISP or the ISP's 
proxy server 30. The proxy server 30 acts as a centralized intermediary, and is capable of 
observing all of the incoming and outgoing HTTP traffic flows. 



In one embodiment, proxy servers 30 are used inside corporate information networks, 
especially at a point of traffic aggregation, or gateways between geographic divisions or the 
greater Internet. Analogous to the Internet Service Provider model, a client 10 inside a 
corporate intranet may access a server 40 inside the intranet, or on the greater intemet, 
through the services of an intermediary proxy server 30. 



Proxy servers 30, in addition to serving as the pipeline between clients 10 and the 
Intemet, may also implement caching techniques, by storing replicas of popular information 
documents in a proximate local storage. Through the use of caching, proxy servers 30 can 



PROXY SERVERS AND CORPORATE INTRANETS 



CACHING PROXY SERVERS 
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quickly provide replicas of information to clients 10 from the proximate local storage, 
reducing delays and internet traffic to the more distant origin servers. 

Proxy servers 30 receive requests for resources that are directed from clients 10 to 
servers 40. When the cache in proxy servers 30 has a replica of the requested resource that 
meets certain freshness and applicability constraints, proxy servers 30 respond to the clients 
10 and serve the resource directly. In this arrangement, the number and volume of data 
transfers between a proxy server 30 and origin servers 40 are greatly reduced. As a result, 
network resources or objects are provided more rapidly and at lower cost to clients 10. 

THE WEB PAGES 

Users typically use a web browser application, such Netscape Conmiunicator, 
Microsoft Explorer, etc., on a client 10 to navigate through the Intemet 20, and to perform 
transactions with various origin servers 40 for desired information. Browsers are computer 
programs that are designed to request items from the network, and to decode and display the 
requested items when they are provided to clients 10. 

The user interface of a browser includes, among other things, (1) a content display 
region, and (2) a control region. The control region of a browser may contain any number 
and type of controls that allow a user to access the functionality of the browser. The content 
display region of a browser is a region dedicated to the display of content retrieved by the 
browser. 

Users typically receive information from the web in form of web pages that are 
displayed in the content display region of the browser. Web pages have a hierarchical 
architecture. Each web page may consist of various items, where each distinct item must be 
separately fetched. Items that are parts of a web page are referred to herein as "embedded" 
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items. The embedded items of a single web page may even reside on different origin servers 
40 on the Intemet 20. An embedded item may be, for example, a digital image, a sound file, 
or a link to another web page. Embedded items may themselves be web pages. For example, 
a web page may specify (1) a set of fi^ames and (2) web pages that are to populate those 
frames. 

The browser interprets the HTML representing a web page, and in response to 
decoding tags to embedded items, sends requests to retrieve the embedded items to 
"complete" the generation of the web page. From the HTML data, the browser knows how to 
get each embedded document fi-om the URL of the document. Since an embedded item may 
itself include embedded items, the browser recursively requests the embedded items until the 
document is fully rendered. 

AFFINITY MERCHANDISING, BRANDING, AND SCREEN REAL ESTATE 
As users of the Intemet 20 increase, numerous commercial parties seek to extract 
commercial value by marketing to users as they browse the Intemet. These commercial 
parties seek to exploit two factors of Intemet browsing: (1) the interests of users can be 
inferred by their Intemet browsing behavior, and (2) these interests can be monetized by 
targeting merchandizing to the user (displaying advertisements, promotions, or other 
marketing vehicles in sections of the user's web browser). The web browser in effect acts as 
"billboard" space for paid marketing and merchandising placements. 



Origin server-generated advertisements are commonly visible to today's Intemet 
users. Typically, these advertisements are included as part of the web pages that the users 
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request. As the users receive and watch the requested web pages, the users also see the 
advertisements in the web pages. Because the advertisements are embedded items within the 
web page, they are displayed as part of the web page within the content display region of the 
browser. In addition, because the advertisements are part of the web pages, they are 
5 controlled by the parties that control the web pages (the "content providers"). The content 
providers therefore obtain the revenue that the sale of such advertising generates. 

hi this scenario, the organization responsible for the origin server 40 desires clients to 
frequently visit their web server as opposed to competitive properties, for two reasons. First, 
the origin server organization receives compensation for delivering merchandising campaigns 

1 0 to the user. The more the user visits the origin server 40, the higher the revenue. Second, the 
origin server organization can charge higher rates for use of its display real estate if it can 
collect more demographic information about the user. If the user is more active on the origin 
server 40, the origin server 40 can collect more information about the user's overall interests. 
In both cases, the origin server 40 is attempting to increase the user's "affinity" to the 

1 5 web site, because greater affinity implies greater revenue potential. Because users are not 
forced to visit any particular web site, web sites compete fiercely with brand and experience 
to capture "loyal" user communities. 

Origin servers 40 that attempt to attract and retain users through the use of brand and 
services are conventionally called "portals." The goal of such servers is to become the single 

20 destination for Intemet transactions, thus maximizing customer affinity. 

CLIENT PORTALS 
In another arrangement, the user is provided a customized browser that displays 
advertisements, promotions, and branding decoration either within the browser, or in an 
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alternative section of display screen real estate (often floating, always-on-top windows). 
However, users generally prefer to choose their own browsers, and therefore are only enticed 
to use such custom browser software through incentives such as free Internet access. 
Additionally, the inconvenience and cost of managing software distribution to end consumers 
5 makes customized browsers an undesirable solution in many situations. 

In a third arrangement, a user browses to a specific origin server 40 which provides an 
application gateway for browsing other web sites. The origin server implements a kind of 
"browser in a server," providing server-side controls that let the users specify other sites that 
the users want to access. As the users use the controls of the control site to browse the 
£1 1 0 desired web pages, advertisements are sent to the users from the controller of the control site. 



As with all server solutions, users can typically circumvent the control site by pointing their 



browser software to an altemative origin server, thus avoiding the advertisements sent. 



Similar to origin server portals, browsers customized to support affinity 



merchandising act as a kind of "client-side" portal — obtaining a captive audience, and 



1 5 monetizing captured screen real-estate. 
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SUMMARY OF THE INVENTION 

Techniques are disclosed for allowing a proxy server to deliver unrequested content to 
users browsing the Internet. In such a configuration, the proxy server is called a "porthole 
engine." 

In one embodiment, a user requests a web page. In response, the user receives the 
requested web page accompanied by unrequested information that an organization wishes the 
user to see. The requested web page and the unrequested information are displayed on the 
user's display, either within the browser, or as part of a separate window. In one 
embodiment, the requested web page and the unrequested information are displayed in 
fi'ames. For example, the porthole engine responds to requests for web pages by sending 
firame data to the user, where the fi"ame data (1) defines fi^ames, and (2) specifies embedded 
items that are to populate the frames, where one of the embedded items is the originally- 
requested web page. 

In one embodiment, the unrequested information may be implemented according the 
capabilities of the particular browser software, customized to the browsing consumer, 
targeted to explicit and inferred characteristics of the consumer, and branded to the 
organization providing access. 

Transformational proxies deployed to capture and monetize display real estate are 
referred to herein as "Web Portholes." The user browses the web as usual, but all 
transactions pass through the centralized porthole engine acting as a "porthole" to the web. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention is illustrated by way of example, and not by way of limitation, 
in the figures of the accompanying drawings and in which like reference numerals refer to 
similar elements and in which: 

FIG. 1 is a simplified block diagram of the relationship between elements in a web 

system; 

FIG. 2 shows an exemplary content display area of a browser of a client after a user 
requests a web page; 

FIG. 3 shows the event sequences in which a web page is requested and delivered 
with unrequested information, using the fi-ame- wrapping technique; and 

FIG. 4 is a block diagram of a computer upon which an embodiment of the invention 
may be implemented. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

Techniques are provided for allowing a proxy server to deliver unrequested content to 
users browsing the Internet. In this configuration, the proxy server is termed a "porthole 
engine." 



In one embodiment, in response to a request from a user for an information document, 
porthole engine 30 receives the virgin response from the origin server 40, scans the response, 
and sends a modified response back to a client 10 which "captures" a portion of display real 
estate, and populates it with new documents, typically containing branding and 
merchandising campaigns. 

Typically, the captured screen real estate may be within the browser's content window 
or within auxiliary windows. In both cases, the screen real estate is captured by modifying 
the information documents as they pass through porthole engine 30. Porthole engine 30 
supports various mechanisms for capturing display real estate, four of which are described 
herein. These four mechanisms may used separately or one in conjunction with another. 



1 . Modifying web page content to encapsulate the original content within a 
paneled "frame"; the other panels may be used as captured real-estate. 

2. Modifying the web page content to reserve space at the top or bottom of 
the web page as captured real-estate. 

3. Modifying the web page content to pop-up auxiliary display windows for 
use as captured display real-estate. 

4. Modifying the web page content to replace existing content (graphics or 
text) with replacement content. 



DELIVERING THE REQUESTED WEB PAGE 
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Because browser clients have different abilities, and because information documents 
contain various features, some of which may conflict with the screen-real-estate-claiming 
modifications, porthole engine 30 needs to rewrite information content being aware of the 
capabilities of the browser, and the extant features in each information document. 

Because different users of the solution will have different branding and conunercial 
requirements, porthole engine 30 implements a general and parameterizable page-rewriting 
system that can apply different techniques for screen real-estate capture and different 
unrequested content for different consumers. 

FRAME-WRAPPING 
In one embodiment, porthole engine 30 implements display real-estate acquisition and 
population by 'trapping" the target content page with additional frames which can be used 
for unrequested information. In such an embodiment, in response to a request from a user for 
a web page, porthole engine 30 does not initially retum to the user the requested page, but 
instead returns a totally different document in the form of frameset scaffolding data. This 
frameset data identifies various embedded items, including the original requested web page, 
and indicates how these documents are to be displayed in frames within the content display 
area of the user's browser. The requested web page is thus displayed in only a portion of the 
content display area. 

FIG. 2 shows an exemplary content display area 200 of a browser of a client 10 after a 
user requests a web page, such as, for example, http://www.cajun-gifts.com/. In response to 
the request, porthole engine 30 delivers to the browser a web page that includes frame data 
which, when interpreted by the browser, causes the browser to divide the content display area 
200 into three distinct regions: area 204, area 208-1 and area 208-2. The frame data also 

-12- 

50269-036 

SJS 24962-11.050269.0036 



m 



includes code that instructs the browser how to populate the three areas, hi the illustrated 
example, areas 204, 208-1 and 208-2 are respectively populated by http://www.cajun- 
gifts.com/, http://some-isp.net/porthole/framel.html and http://some- 
isp.net/porthole/frame2.html. 
5 Area 204 displays the content of http://www.cajun-gifts.com/, which was initially 

requested by the user, and requested a second time as an "embedded item" within the framed 
web page. Areas 208-1 and 208-2 display the unrequested information. The unrequested 
information may be any form of content that was not requested by the user, such as 
advertisements. While the unrequested information is '^inrequested'* relative to the user, the 
10 unrequested data is requested by the browser in response to the browser decoding the frame 
data. 

The unrequested content that is sent to the user is specified in the frame data that is 
sent to the user, and the frame data is sent by porthole engine 30. Consequently, the 
controller of porthole engine 30 also controls which unrequested information is sent to the 
1 5 user. Typically, the party that controls porthole engine 30 is the ISP or the corporation 
running an intranet. 

PAGE MODIFICATION 
The above section discussed the use of frame-wrapping to acquire and populate 
screen real estate. In one embodiment, porthole engine 30 also support HTML rewriting to 
20 append objects to the header/footer of a document, to replace objects within a document, and 
to generate pop-up windows. 

All these techniques are implemented by porthole engine 30 by modifying the 
response documents to attach new document components. 
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SELECTING AND TARGETING UNREQUESTED INFORMATION 
The proxy owner has available to it information about users, and porthole engine 30 
has available to it information about the documents requested by users. Consequently, 
porthole engine 30 may be configured to select the unrequested content to be sent to users 
5 that is tailored to the particular users and/or the particular interests reflected by the content 
requested by users. 

In one embodiment, the unrequested information, or the content of display areas 208, 
is related to the content of http://www.cajun-gifts.com/. For example, if http://www.cajun- 
gifts.com/ is about Louisiana culture, then display areas 208 may display information 
10 interesting to consumers who appreciate Louisiana culture, including for example, 

advertisements for Jazz music CDs, Creole cookbooks, Mardi Gras travel packages, etc. In 
one embodiment, the content of display areas 208 are other web pages, often hosted by the 
organization running the porthole engine 30, such as, http://some- 
isp.net/porthole/fi-amel.html and http://some-isp.net/porthole/frame2.html, etc. 

1 5 EVENT SEQUENCES IN FRAME- WRAPPING: RECEIVING THE REQUESTED WEB 

PAGE AND THE UNREQUESTED INFORMATION 
FIG. 3 shows the event sequences for frame-wrapping in which http://www.cajun- 
gifts.com/ is requested and delivered with the unrequested information provided by 
http://some-isp.net/porthole/framel .html and http://some-isp.net/porthole/frame2.html. 
20 Columns A, B, and C correspond to a client 10, a porthole engine 30, and various servers 40, 
respectively. 

In step 1, a user, via a web browser, sends an initial request for http://www.cajun- 
gifts.com/ to porthole engine 30. In response, porthole engine 30 in step 2 sends back to the 
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user frameset data. This frameset data generates an encapsulation of the original target page 
with panels containing the unrequested information. 

The frameset data identifies http://www.cajun-gifts.com/, http://some- 
isp.net/porthole/framel.html, and http://some-isp.net/porthole/frame2.html to be displayed in 
5 content display area 200. This frameset data also indicates how the documents 

http://www.cajun-gifts.com/, http://some-isp.net/porthole/framel.html, and http://some- 
isp.net/porthole/frame2.html are to be displayed. As shown in FIG. 2, http://www.cajun- 
gifts.com/, http://some-isp.net/porthole/framel.html, and http://some- 
isp.net/porthole/frame2.html are displayed in areas 204, 208-1, and 208-2, respectively. 
10 In step 3, the browser decodes the frameset data and, upon decoding the tags for the 

embedded items, sends requests for http://www.cajun-gifts.com/, http://some- 
isp.net/porthole/framel.html, and http://some-isp.net/porthole/frame2.html to porthole engine 
30. 

In contrast to the request for http://www.cajun-gifts.com/ issued in step 1, the request 

15 for http://www.cajun-gifts.com in step 3 is requesting http://www.cajun-gifts.com/ as an 

embedded item. As shall be described in greater detail below, techniques are provided which 

allow porthole engine 30 to distinguish initial requests from requests for porthole-embedded 

items. The ability to distinguish initial requests from requests for porthole-embedded items is 

important because, if all requests were treated as initial requests, porthole engine 30 would 

20 always respond by sending frame data and the user would never receive the actual content 

that the user desires. 

In step 4, porthole engine 30 forwards requests for http://www.cajun-gifts.com/, 

http://some-isp.net/porthole/framel.html, and http://some-isp.net/porthole/frame2.html to 

appropriate origin servers 40. In step 5, each of the servers 40 that receive corresponding 
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requests for http://www.cajim-gifts.com/, http://some-isp.net/porthole/framel.html, and 
http://some-isp.net/porthole/frame2.html, sends the requested information to porthole engine 
30, which, in step 6, forwards the information to be displayed within the content display 
region of the browser from which the initial request was issued. 

As the browser receives the information in step 6, the browser determines whether 
any of the retrieved items include addition embedded items. If there are additional embedded 
items to be retrieved, then the browser requests and receives the additional embedded items, 
as in steps 3-6 until there is no more embedded items to be retrieved. 

DISTINGUISHING REQUESTS FOR PORTHOLE-EMBEDDED ITEMS 

REWRITING LINKS AND URLS 

In one embodiment, initial requests are distinguished from requests for embedded 
items based upon information contained in the URLs associated with the requests. 
Specifically, in response to an initial request for an item, porthole engine 30 constructs the 
web page that includes the frame data (the "frame data page"). The frame data page is sent 
back to the client 10 in response to the initial request. The frame data page specifies the 
initially-requested item as an embedded item. Typically, the initially-requested item would 
be identified as an embedded item by including a tag to the initially-requested item within the 
frame data page, where the tag includes the URL of the initially-requested item. 

According to one embodiment, the tag for the initially-requested item that is stored in 
the frame data web page includes a "fat" URL. A "fat" URL is a URL that includes data in 
addition to the address of the resource associated with the URL. According to one 
embodiment, the fat URL that is stored in the frame data page for the initially requested item 
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includes (1) the address of the initially requested data, and (2) an embedded-item flag that 

indicates to porthole engine 30 that the request is for an embedded item. 

When the browser decodes the tag containing the fat URL during the decoding of the 

frame data page, the browser will send a request to porthole engine 30 to retrieve the initially 

5 requested item. That request will contain the embedded-item flag from fat URL. In response 

to detecting that a request contains the embedded-item flag, porthole engine 30 retrieves the 

requested data item as normal, without responding with a frame data page. 

The initially requested item may itself include links to one or more embedded items. 

When the browser decodes the initially requested item, the browser will send requests for 

10 those embedded items. Li response to those requests, the browser should retrieve the 

requested data items rather than respond with a frame data page. According to one 

embodiment, porthole engine 30 distinguishes subsequent requests by ensuring that they too 

contain the embedded-item flag. 

To ensure that all requests for embedded items contain the embedded-item flag, 

1 5 porthole engine 30 inspects web pages prior to delivering them to the browser. If a web page 

contains tags to embedded items, those tags are rewritten to include the embedded-item flag. 

Consequently, when the tags are decoded by the browser, the browser responds by issuing 

requests that include the embedded-item flag. 

For browsers that support the HTTP Referer tag, HTML rewriting of embedded tags 

20 for the purpose of detecting embedded pages may be eliminated. When the browser fetches 

embedded documents from a page already tagged as being frame-wrapped, the Referer field 

is set to the URL of the frame-wrapped parent. Porthole engine 30 can recognize these pages 

as also frame-wrapped, can attach the embedded-item flag, and send a "redirect" message 

back to the browser asking it to fetch the newly tagged URL. This Referer/redirect technique 
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is an alternative to rewriting the HTML in the porthole engine 30. It provides the same result 
of tagging URLs as porthole-embedded. 

In one embodiment, porthole engines 30 chooses to rewrite the links in embedded 
framed documents to affect frame behavior. For example, the TARGET attribute of a link 
controls which frame a new document will be loaded into. Porthole engine 30 chooses to 
rewrite links, for example, to set the TARGET to be the entire browser contents window. 
This will cause the full contents window to be redrawn, the porthole frame contents to be 
refreshed, and the top-level URL display on the browser toolbar to be updated. 

For illustration purposes, simplified pseudocode for the essence of frame wrapping is 



^ 10 shown here: 

.■'=?=: 



While (1) 
{ 

// (1) proxy read the next HTTP request 
request = http_read (client_socket) ; 



// (2) If the browser is requesting a top-level URL, instead of 
20 // loading a page from within a porthole, send back a frameset 

// to grab display real-estate, instead of sending back the page. 

in_porthole = {has_embedded_tag (request . url) || has_embedded_tag (request . re ferer) ) j 
if (! in_porthole) 

25 ( 

tagged_url = add_embedded_tag (request. url) ; 
frameset = build_frameset(tagged_url) ; 
http_msg = http_build_response ( frameset) ; 
http_write (client_socket, http_msg) ; 

30 ) 



-18- 

50269-036 

SJS 24962-11.050269.0036 



// otherwise, the URL request originated from within a porthole. There 
// are two possibilities: (3) the requested URL is tagged as being in 
// the porthole (done if HTML rewriting is performed), and (4) the 
// requested URL is not tagged as being in a porthole (happens when the 
// redirect mode is used instead of HTML rewriting. 

// (3) If the referring parent is tagged as being inside a porthole, 
// but the requested URL is not tagged. This only happens in 
// the optional mode where HTML proxy rewriting is not done. In 
// this case,' we tag the URL and send a redirect back to the 
// client, asking it to fetch the new URL instead. 

else if (has_embedded_tag (request . referer) && (! has_embedded_tag (request . url) ) ) 
{ 

tagged_url = add_erabedded_tag ( request , url) ; 
http_msg = http_build_redirect (tagged_url) ; 
http_write (client_socket, http_msg) ; 

} 

// (4) The requested URL is tagged as being inside a porthole. 

// Since the porthole already exists, we want to send back the 

// real page contents, and NOT wrap it in another frameset. 

else // case (4) 

{ 

url = remove_embedded_tag (request . url) ; 
server_socket = http_connect (url . server, url .port ) ; 
http_msg = http_build_request (url) ; 
http_write (serve r_socket, http_msg) ; 
response_msg = http_read (server_socket ) ; 

rewrite_html (response_msg) ; 

http_write (client_socket, response_msg) ; 

} 

} 
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PRACTICAL CONSIDERATIONS 
In practice, differences in browsers and in page components can cause rewritten pages 
to not behave correctly on certain revisions of certain vendor's browser software. A practical 
deployment of web portholes will need to recognize incompatibilities between document 
constructs and browser capabilities, and act accordingly. In some cases, porthole engine 30 
may decide to replace one version of transformation (e.g. frame-wrapping) with another 
version of transformation (e.g. pop-up window) when such incompatibilities exist of are 
suspected. 

For example, some browsers do not support frames. Others do not support pop-up 
windows. Other browsers have deficiencies or idiosyncrasies that porthole engines 30 will 
need to work around. 

Similarly, some form of page content may interfere with the porthole engine 30. For 
example, some HTML pages include client-side active languages such as "Javascript" which 
attempt to manipulate frames. These constructs may conflict with the porthole engine's 
effort to perform frame- wrapping. In one embodiment, porthole engine 30 will attempt to 
detect these conflicts, and either defuse the conflicts by rewriting the HTML, by applying an 
alternate real-estate capture technique, or by disabling the porthole for conflicting pages. 

In some cases, it is difficult to detect all incompatibilities a priori. In one 
embodiment, the transformational proxy has a list of manual exceptions for servers, clients, 
and web pages that should be processed in a different way, or left entirely untouched. 

PERSONALIZATION 
In one embodiment, porthole engine 30 distinguishes individual client users, and uses 
the user identification to personalize the captured screen real-estate. The user identity can be 
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used to modify the look and feel of the display, to select among services offered to clients, 
and to augment the methods used to target advertisements and merchandising offers to the 
user. 

Four techniques for identifying users are used in various embodiments of the 
invention, and are described as follows: 

1 . Cookies — porthole engine 30 can attach cookies to the pages the user 
browses to identify the user. Cookies are client-side state passed from 
browser to server in HTTP header information, and writable by the 
porthole engine 30. 

2. Radius authentication — when using telephone dial-up connections, the 
user authenticates with a user name and password before the network 
connection is established. This authentication is conraionly performed and 
accoimted using a protocol called RADIUS. By observing the RADIUS 
authentication transactions and the resulting network address assigned to a 
client 10, porthole engine 30 can relate the network address (visible on all 
client communication) with the underlying usemame. 

3. IP tracking — in environments where users are assigned particular 
network addresses, and in environments where a network address to user 
identity database is available, porthole engine 30 can identify the user from 
its network address. 

4. Proxy authentication — porthole engine 30 can perform its own challenge- 
response authentication, prompting the user for a usemame and password 
before providing access through the porthole. This technique is 
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implemented using HTTP proxy authentication machinery which is part of 
the HTTP standard. 




; 



BRANDING AND VERTICAL PORTHOLES 
In one embodiment, porthole engine 30 is parameterizable to yield a variety of 
5 different user experiences. A single instance of a porthole engine may simultaneously 
provide an arbitrary number of distinct visual appearances and functional behaviors. For 
example, one set of users may be presented with a pop-up advertisement, while another set of 
users may be presented with a top-of-the-page search engine control panel. This functionality 
allows the deployment of virtual, vertical porthole engines, each with a different look and 
10 feel. 

The owner of a porthole engine 30 can offer a variety of distinctively branded, 
distinctively functional browsing experiences, using the same proxy server, through 
customization. 

CENTRALIZED INTERMEDIARY BROWSING ENHANCEMENTS 
15 Because porthole engine 30 acts as a centralized intermediary, processing all 

transaction flows, it offers the ability to provide other value-added services to the browsing 
experience. 

In one embodiment, porthole engine 30 is coupled with databases to provide the 
ability to fill-in popular form fields (such as credit card ordering fields) automatically. As 
20 HTML documents pass through porthole engine 30, named fill-in form fields are seen by 
porthole engine 30. If an HTML page is registered in porthole engine 30 as "fillable," 
porthole engine 30 will automatically insert default information into the fields, including, but 
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not restricted to, name, address, and credit card payment information. Porthole engine 30 
allows the completion of forms from any server, without requiring any client software. 

In one embodiment, a porthole engine 30 is coupled with content filtering technology 
to restrict access to particular web pages. Such filtering is parameterized by the user's 
identity. 

HARDWARE OVERVIEW 

FIG. 4 is a block diagram that illustrates a computer system 400 upon which an 
embodiment of the invention may be implemented. For example, computer system 400 may 
implement a client 10, a porthole engine server 30, or an origin server 40, configured to 
operate as described above. Computer system 400 includes a bus 402 or other 
conmiunication mechanism for communicating information, and a processor 404 coupled 
with bus 402 for processing information. Computer system 400 also includes a main memory 
406, such as a random access memory (RAM) or other dynamic storage device, coupled to 
bus 402 for storing information and instructions to be executed by processor 404. Main 
memory 406 also may be used for storing temporary variables or other intermediate 
information during execution of instructions to be executed by processor 404. Computer 
system 400 fixrther includes a read only memory (ROM) 408 or other static storage device 
coupled to bus 402 for storing static information and instructions for processor 404. A 
storage device 410, such as a magnetic disk or optical disk, is provided and coupled to bus 
402 for storing information and instructions. 

Computer system 400 may be coupled via bus 402 to a display 412, such as a cathode 
ray tube (CRT), for displaying information to a computer user. An input device 414, 
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including alphanumeric and other keys, is coupled to bus 402 for communicating information 
and command selections to processor 404. Another type of user input device is cursor 
control 416, such as a mouse, a trackball, or cursor direction keys for communicating 
direction information and command selections to processor 404 and for controlling cursor 
movement on display 412. This input device typically has two degrees of freedom in two 
axes, a first axis (e.g., x) and a second axis (e.g., y), that allows the device to specify 
positions in a plane. 

The invention is related to the use of computer system 400 for implementing the 
techniques described herein. According to one embodiment of the invention, those 
techniques are implemented by computer system 400 in response to processor 404 executing 
one or more sequences of one or more instructions contained in main memory 406. Such 
instructions may be read into main memory 406 from another computer-readable medium, 
such as storage device 410. Execution of the sequences of instructions contained in main 
memory 406 causes processor 404 to perform the process steps described herein. ]n 
alternative embodiments, hard-wired circuitry may be used in place of or in combination with 
software instructions to implement the invention. Thus, embodiments of the invention are 
not limited to any specific combination of hardware circuitry and software. 

The term "computer-readable medium" as used herein refers to any medium that 
participates in providing instructions to processor 404 for execution. Such a medium may 
take many forms, including but not limited to, non- volatile media, volatile media, and 
transmission media. Non-volatile media includes, for example, optical or magnetic disks, 
such as storage device 410. Volatile media includes dynamic memory, such as main memory 
406. Transmission media includes coaxial cables, copper wire and fiber optics, including the 
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wires that comprise bus 402. Transmission media can also take the form of acoustic or light 
waves, such as those generated during radio-wave and infra-red data commimications. 

Common forms of computer-readable media include, for example, a floppy disk, a 
flexible disk, hard disk, magnetic tape, or any other magnetic medium, a CD-ROM, any other 
5 optical medium, punchcards, papertape, any other physical medium with patterns of holes, a 
RAM, a PROM, and EPROM, a FLASH-EPROM, any other memory chip or cartridge, a 
carrier wave as described hereinafter, or any other medium from which a computer can read. 

Various forms of computer readable media may be involved in carrying one or more 
sequences of one or more instructions to processor 404 for execution. For example, the 
Q 1 0 instructions may initially be carried on a magnetic disk of a remote computer. The remote 

fyl computer can load the instructions into its dynamic memory and send the instructions over a 

42 

4= telephone line using a modem. A modem local to computer system 400 can receive the data 

M 

fll on the telephone line and use an infra-red transmitter to convert the data to an infra-red 

%^ signal. An infra-red detector can receive the data carried in the infra-red signal and 

.J. 

21 1 5 appropriate circuitry can place the data on bus 402. Bus 402 carries the data to main memory 

406, from which processor 404 retrieves and executes the instructions. The instructions 

received by main memory 406 may optionally be stored on storage device 410 either before 

or after execution by processor 404. 

Computer system 400 also includes a conmiunication interface 418 coupled to bus 

20 402. Communication interface 418 provides a two-way data communication coupling to a 

network link 420 that is connected to a local network 422. For example, communication 

interface 418 may be an integrated services digital network (ISDN) card or a modem to 

provide a data conmiunication connection to a corresponding type of telephone line. As 

another example, communication interface 418 may be a local area network (LAN) card to 
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provide a data communication connection to a compatible LAN. Wireless links may also be 
implemented. In any such implementation, communication interface 418 sends and receives 
electrical, electromagnetic or optical signals that carry digital data streams representing 
various types of information. 
5 Network link 420 typically provides data communication through one or more 

networks to other data devices. For example, network link 420 may provide a connection 
through local network 422 to a host computer 424 or to data equipment operated by an 
hitemet Service Provider (ISP) 426. ISP 426 in tum provides data communication services 
through the world wide packet data communication network now commonly referred to as 

10 the "Internet" 428. Local network 422 and Intemet 428 both use electrical, electromagnetic 
or optical signals that carry digital data streams. The signals through the various networks 
and the signals on network link 420 and through communication interface 418, which carry 
the digital data to and from computer system 400, are exemplary forms of carrier waves 
transporting the information. 

15 Computer system 400 can send messages and receive data, including program code, 

through the network(s), network link 420 and communication interface 418. In the Intemet 
example, a server 430 might transmit a requested code for an application program through 
Intemet 428, ISP 426, local network 422 and communication interface 418. In accordance 
with the invention, one such downloaded application implements the techniques described 

20 herein. 

The received code may be executed by processor 404 as it is received, and/or stored in 
storage device 410, or other non- volatile storage for later execution. In this manner, 
computer system 400 may obtain application code in the form of a carrier wave. 
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In the foregoing specification, the invention has been described with reference to 
specific embodiments thereof It will, however, be evident that various modifications and 
changes may be made thereto without departing fi'om the broader spirit and scope of the 
invention. The specification and drawings are, accordingly, to be regarded in an illustrative 
rather than a restrictive sense. 
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